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(54) Apparatus and method for in-situ endpoint detection and monitoring for chemical 
mechanical polishing operations 



(57) The disclosure relates to an apparatus for 
chemical mechanical polishing (CMP) of a wafer com- 
prising a rotatable polishing patent (16) with an overly- 
ing polishing pad (18) wetted with an abrasive slurry, the 
platen being rotatably mounted to a chassis, a rotatable 
polishing head (12) for holding the wafer (14) against 
the polishing pad, the wafer comprising a semiconduc- 
tor substrate underlying an oxide layer; and an endpoint 



detector The detector comprises a laser interferometer 
(32) capable of generating a laser beam directed to- 
wards the wafer and detecting light reflected frcm the 
wafer, and a window (36) disposed adjacent a hole (30) 
formed through the platen, the window providing a path- 
way for the laser beam to impinge on the wafer at least 
during part of a penod of time when the wafer overnes 
the window. 
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Description 

This invention relates to semiconductor manufac- 
ture, and more particularly, to an apparatus and method 
for chemical mechanical polishing (CMP) and in-situ 
endpomt detection during the CMP process 

In the process o1 fabricating modern semiconductor 
integrated circuits (ICs). it is necessary to torm various 
material layers and structures over previously lormed 
layers and structures. However, the prior formations of- 
ten leave the top surface topography of an in-process 
water highly irregular, with oumps. areas of unequal el- 
evation troughs, trenches and/or other surface irregu- 
larities. These irregularities cause problems when form- 
ing the next layer. For example when printing a photo- 
lithographic pattern having small geometries over pre- 
viously formed layers, a very shallow depth of focus is 
required. Accordingly, it becomes essential to have aflat 
and planar surface, otherwise, some parts of the pattern 
will be in focus and other pans will not. In fact, surface 
variations on the order of less than 1000 A over a 25 x 
25 mm die would be preferable In addition, if the irreg- 
ularities are not leveled at each major processing step, 
the surface topography of the wafer can become even 
more irregular, causing further problems as the layers 
stack up during further processing. Depending on the 
die type and the size of the geometries involved, the sur- 
face irregularities can lead to poor yield and device per- 
formance. Consequently, it is desirable to effect some 
type of planarization, or leveling, of the IC structures. In 
fact, most high density IC fabrication techniques make 
use of some method to form a plananzed wafer surface 
at critical points in the manufacturing process 

One method for achieving semiconductor wafer 
planari7ationortopography removal is the chemical me- 
chanical polishing (CMP) process. In general, the chem- 
ical mechanical polishing (CMP) process involves hold- 
ing and/or rotating the wafer against a rotating polishing 
platen under a controlled pressure. As shown in Fig. 1, 
a typical CMP apparatus 10 includes a polishing head 
12 for holding the semiconductor wafer 14 against the 
polishing platen IS The poshing platen 15 is covered 
with a pad 1 S. This pad 1 6 typically has a backing layer 
2G which interfaces with the surface of the platen and a 
covering layer 22 which is used in conjunction wrth a 
chemical polishing slurry to polish the wafer 1 4 Howev- 
er some pads have only a covering layer and no backing 
layer The covering layer 22 is usually either an open 
cell foamed polyurethane (e.g. Rodel IC1000) or a sheet 
of polyurethane with a grooved surface (e.g. Rodel 
EX2000). The pad materia! \s wetted with the chemical 
polishing slurry containing both an abrasive and chem- 
icals One typical chemical slurry includes KOH (Potas- 
sium Hydroxide) and fumed-silica particles The platen 
is usually rotated about its central axis 24. in addition, 
the polishing head is usually rotated about its central 
axis 26, and translated across the surface of the platen 
1 6 via a translation arm 28 Although |ust one polishing 
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head is shown in Fig 1, CMP devices typically have 
more than one of those heads spacod crcumfereniialry 
around the polishing platen 

A particular problem encountered during a CMP 

5 process is in the determination that a part has been 
plananzed to a desired flatness or relative thickness In 
general, there is a need to detect when the desired sur- 
face characteristics or planar condition has been 
reached This has been accomplished in a variety of 

1 0 ways Early on, it was not possible to monitor the char- 
acteristics of the water during the CMP process Typi- 
cally, the wafer was removed from the CMP apparatus 
and examined elsewhere 11 the wafer did not meet the 
desired specifications, it had to be reloaded into the 

?5 CMP apparatus and reprocessed. This was a time con- 
suming and labor-intensive procedure Alternately the 
examination might have revealed that an excess 
amount of material had been removed rendering the 
part unusable There was, therefore, a need in the art 

20 for a device which could detect when the desired surface 
characteristics or thickness had been achieved, in-situ 
during the CMP process. 

Several devices and methods have been developed 
for the in-situ detection of endpomts during the CMP 

25 process. For instance, devices and methods that are as- 
sociated with the use of ultrasonic sound waves, and 
with the detection of changes in mechanical resistance 
electrical impedance, or wafer surface temperature, 
have been employed. These devices and methods rely 

30 on determining the thickness of the wafer or a layer 
thereof, and establishing a process endpomt by moni- 
toring the change in thickness. In the case where the 
surface layer of the wafer is being thinned, the change 
in thickness is used to determine when the surface layer 

— has the desired depth. And, in the case of pianarizmg a 
patterned wafer with an irregular surface, the endpomt 
is determined by monrtormg the change in thickness and 
knowing the approximate depth of the surface irregular- 
ities. When the change in thickness equals the depth of 

40 the irregularities, the CMP process is terminated Al- 
though these devices and methods work reasonably 
well for the applications for which they were intended, 
there is still a need for systems which provide a more 
accurate determination of the endpert 

45 The present invention is directed to an endpoint de- 
tector and methods for using it to provide improved ac- 
curacy and further useful information about the polishing 
process The apparatus and method of the present in- 
vention employ interlerometric techniques for the in-situ 

50 determination of the tnickness of material removed or 
Diananty of a wafer surface during the CMP process 

Specifically, the foregoing objective is attained by 
an apparatus*and method of chemical mechanical pol- 
ishing (CMP) employing a rotatable polishing platen with 

55 an overlying polishing pad, a rotatable polishing head 
for holding the wafer against the polishing pad, and an 
endpoint detector. The polishing pad has a backing layer 
which interfaces with the platen and a covering layer 
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which is wetted with a chemical slurry and interfaces 
with the wafer The wafer is constructed of a semicon- 
ductor substrate underlying an oxide ;ayer And, the 
erdpomt detector includes a laser interferometer capa- 
ble cf generating a laser oeam directed towards the wa- 
fer and detecting light reflected therefrom, and a window 
disposed adjacent to a hole formed through the platen 
This window provides a pathway for the laser beam to 
impinge on the wafer, at least during the time that the 
water ovehies the window 

The window can take several forms Among these 
are an insert mounted within the piaten hole. This insert 
is made of a material which is highly transmissive to the 
laser beam, such as quartz In this configuration of the 
window, an upper surface of the insert protrudes above 
a surface of the platen and extends away from the platen 
a distance such that a gap is formed between the upper 
surface of the insert and the wafer, whenever the wafer 
is held against the pad. This gap is preferably made as 
small as possible, but without allowing the insert to touch 
the wafer Alternately, the window can take the form of 
a portion of the polishing pad from which the adjacent 
backing layer has been removed This <s possible be- 
cause the pofyurethane covering layer is at least partial- 
ly transmissive to the laser beam Finally, the window 
can take the form of a plug formed in the covering layer 
of the pad and having no backing layer This plug is pref- 
erably made of a polyurethane material which is highly 
transmissive to the laser beam. 

In one embodiment of the present invention, the 
hole through the platen, and the window, are circular in 
shape In another, the hole and window are arc-shaped 
The arc-shaped window has a radius with an origin co- 
incident to the center of rotation of the platen. Some em- 
bodiments of the invention also have a laser beam 
whose beam diameter, at its point of impingement on 
the wafer, is significantly greater than the smallest di- 
ameter possible for the wavelength employed. 

The CMP apparatus can also include a position 
sensor for sensing when the window is adjacent the wa- 
fer. This ensures that the laser beam generated by the 
laser interferometer can pass unblocked through the 
window and impinge on the wafer. In a preferred em- 
bodiment of the invention, tne sensor includes a f! aq at- 
tached along a portion of the periphery or the platen 
which extends radially outward therefrom In addition, 
the r e is an optical interrupter-type sensor mounted to 
the chassis at the periphery cf the platen This sensor 
is capable of producing an optical beam which causes 
a signal to be generated for as long as the optical beam 
:s interrupted by tne fiag. Thus, the flag is attached to 
the periphery of the platen in a position sucn that ihe 
optical beam is interrupted by the flag, whenever the la- 
ser beam can be made to pass unblocked through the 
window and impinge on the wafer 

Further, the laser interferometer includes a device 
for producing a detection signal whenever light reflected 
from the wafer is detected, and the posrtion sensor in- 



cludes an element for cutputting a scrsirg signal when- 
over the window :s adjacent the wafer Ths allows a data 
acquisition dev ce to sample the detection signal from 
:he laser interferometer for the duration of the sensing 

5 signal from the posrtion sensor The data acquisition de- 
vice then employs an element for outputting a data sig- 
nal representing the sampled detection signal This data 
acquisition device can also include an element lor inte- 
grating the sampled detection signal from the laser mter- 

■ 0 ferometer over a predetermined period of time such that 
the output is a data signal representing the integrated 
samples of the detection signal in cases where the 
aforementioned predetermined sampie period cannot 
be obtained during only one revolution of the platen, an 

* 5 alternate method of piece-wise data acquisition can be 
employed. Specifically, the data acquisition device can 
include elements for performing the method of sampling 
the detection signal output from the laser interferometer 
during each complete revolution of the piaten for a sam- 

20 pie time, integrating each sampie of ihe detection signal 
over the sample time to produce an integrated value cor- 
responding to each sample, and storing each integrated 
vaiue The data acquisition device then uses other ele- 
ments for computing a cumulative sample tirne after 

25 each complete revolution of the platen (where the cu- 
mulative sample time is the summation of the sample 
times associated with each sample of the detection sig- 
nal), comparing the cumulative sample time to a desired 
minimum sample time, and transferring the stored inte- 

30 grated values from the storing element to the element 
for calculating a summation thereof, whenever the cu 
mulative sample time equals or exceeds the predeter- 
mined minimum sample time Accordingly, the afore- 
mentioned output is a data signal representing a series 

35 of the integrated value summations from the summation 
element. 

The data signai output by the data acquisition de- 
vice is cyclical due to the interference between the por- 
tion of the laser beam reflected from the surface of the 
40 oxide layer of the wafer and the portion reflected from 
the surface of the underlying wafer substrate, as the ox- 
ide layer is thinned during the CMP process. According- 
ly, the endpomt in a CMP process tc thin the oxide layer 
of a biank oxide wafer can be determined using addl- 
es t;or,a! apparatus elements tor counting a number of cy- 
cles exhibited by the data signal, computing a thickness 
of material removed during one cycle of theoutput signal 
from the wavelength of the laser beam and the index of 
refraction of the oxide layer of the wafer, comparing a 
50 desired thickness of material to be removed from the 
oxide layer to a removed thickness comprising the prod- 
uct of the numher of cycles exhibited by the data signal 
and the thickness of material removed during one cycle, 
and terminating the CMP whenever the removed thick- 
55 ness equals or exceeds the desired thickness ot mate- 
rial to be removed Alternately, instead of counting com- 
plete cycles, a portion of a cycle could be counted The 
procedure is almost identical except that the th ckness 
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of material removed is determined for the portion of the 
cycle, rather than for an entire cycle. 

An alternate way of determining the endpomt in 
CMP processing of a blank oxide wafer uses apparatus 
elements which measure the time required for the data 
signal to complete either a prescribed number of cycles 
or a prescribed portion of one cycle, compute the thick- 
ness of material removed during the time measured, cal- 
culate a rate of removal by dividing the thickness of ma- 
te r ial removed by the time measured ascertain a re- 
maining removal thickness by subtracting the thickness 
o* material removed from a desired thickness of material 
to be removed from the oxide layer, establish a remain- 
ing CMP time by dividing the remaining removal thick- 
ness by the rate of removal, and terminate the CMP 
process after the expiration of the remaining CMP time. 

In addition, this remaining CMP time can be updat- 
ed after each occurrence of the aforementioned number 
of cycles or portions thereof, to compensate for any 
changes in the material removal rate In this case the 
procedure is almost identical except that ascertaining 
the thickness of the material involves first summing all 
the thicknesses removed in earlier iterations and sub- 
tracting this cumulative thickness from the desired thick- 
ness to determine the remaining removal thickness fig- 
ure. 

However, when the wafer has an initially irregular 
surface topography and is to be planarized during the 
CMP process, the data signal is cyclical only after the 
wafer surface has become smooth fn this case an end- 
point to the CMP process corresponding to a determi- 
nation that the wafer has been planarized is obtained by 
employing additional apparatus elements tor detecting 
a cyclic variation in the data signal, and terminating the 
CMP whenever the detecting element detects the cyclic 
variation. Preferably, the detecting element is capable 
of detecting a cyclical variation in the data signal within 
at most one cycle of the beginning of this variation 

In some circumstances, it is desirable to control the 
film thickness overlying a structure on a patterned wafer. 
This film thickness cannot always be achieved through 
the aforementioned planarization However, this control 
can still be obtained by filtering the data signal to ex- 
clude ali frequencies other than that associated with the 
particular structure, or group cf simi!a r !y sized sta- 
tures, over which a specific film thickness is desired Es- 
sentially, once the signal has been filtered any of the 
previously summarized ways of determining a CMP 
endpoint for a blank oxide wafer can be employed on 
the patterned wafer 

The data signal output by the data acquisition de- 
vice can also be advantageously employed for things 
otherthan determining the endpoint of an ongoing CMP 
process Therefore, in another aspect, the invention is 
an m-situ method of measuring uniformity of a layer on 
a substrate during polishing of said layer The method 
includes the steps of directing a light beam toward the 
iayer during polishing; monitoring an interference signal 



produced by the light Deam reflecting ofl of the sub- 
strate, and computing a measure of uniformity from the 
.nterference signal. 

In preferred embodiments, the computing step in- 

5 eludes extracting feature information from the interfer- 
ence signal; and computing the measure of uniformity 
from the extracted feature information The method also 
includes the steps of comparing the measure of uniform- 
ity to a reference and generating an alert when the 

*o measure of uniformity diverges from the reference by 
more than a predetermined amount. The interference 
signal includes a low frequency component and the ex- 
tracting step includes measuring a first characteristic of 
the low frequency component; and deriving the extract- 
's ed information from the first characteristic In fact, the 
interferometer signal also includes a high frequency 
component and the extracting step further includes 
measuring a second characteristic of the high frequency 
signal and deriving the extracted information the said 

20 first and second characteristics More specifically, the 
first and second characteristics are amplitudes of the 
high and low frequency signals, respectively, and the 
deriving step involves computing a ratio of the ampli 
tudes of the high and low frequency signals 

25 in general, in another aspect, the invention is an m- 
situ method of characterizing a process for polishing a 
substrate having a layer formed thereon The method 
includes the steps of directing a light beam toward the 
layer during polishing; monitoring an interference signal 

30 produced by the light beam reflecting off of the sub- 
strate; extracting a signature from the interference sig- 
nal; comparing the extracted signature with stored infor- 
mation that represents a desired operating point for the 
polishing process, and when the extracted signature di- 

35 verges from the stored information by more than a pre- 
determined amount, generating an aiert signai. 

Preferred embodiments include the following fea- 
tures. The method further includes displaying the inter- 
ference signal on a visual display device for an operator 
to view In addition, the step of extracting includes the 
steps of determining a polishing rate from the interfer- 
ence signal; and determining a measure of uniformity 
f rom the interference signal The comparing step in- 
volves comparing the polishing rate and the measure of 

a$ 'jnifo r mity to the stored information 

!n general, in yet another aspect, the invention is a 
substrate polish ;ng system including a platen which dur 
mg processing holds a polishing pad, a polishing head 
which during processing holds the substrate against the 

50 polishing pad on the platen: an interferometer capable 
of generating a collimated light beam which during 
process^ is directed at and impinges on a side of the 
substrate that is being polished and generates an inter- 
ferometer signal; and a data processor programmed to 

55 compute a measure of uniformity from the interference 
signai 

in preferred embodiments, the polishing head and 
the platen are rotatabie so that they both rotate during 
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processrg Also, the interferometer is a laser interfer- 
ometer Further the data processor is also programmed 
to extract feature information 1rom the interference sig- 
nal, and compute the measure of uniformity from the ex- 
tracted feature information The data processor is also s 
programmed to compare the measure of uniformrty to a 
reference; and generate an alert when the measure of 
uniformity diverges from the reference by more than a 
predetermined amount. 

In general, in still yet another aspect the invention 1 ° 
■s a substrate polishing apparatus which includes a plat- 
en which during processing holds a polishing pad; a pol- 
ishing head which during processing holds the substrate 
against the polishing pad on the platen, and an interfer- 
ometer capable of generating a collimated light beam, is 
The platen includes a passage therethrough and the 
interferometer is arranged to direct its light beam at least 
during part of the polishing operation through that pas- 
sage so that it impinges on the substrate The apparatus 
also includes a window aligned with the passage and 20 
through which the light beam passes on its way to the 
substrata. The window has a diffuse surface receiving 
the arriving light beam from the interferometer. 

In general, in another aspect, the invention is a pol- 
ishing pad for use in the above-described chemical me- 25 
chanical polishing system. The polishing pad includes 
a polishing surface; and a bottom surface The bottom 
surface and the polishing surface of the polishing pad 
includes a window formed therein. The window is trans- 
missive to light from an interferometer and has a diffuse 30 
bottom surface. 

In addition to the just described benefits, other ob- 
jectives and advantages of the present invention will be- 
come apparent from the detailed description which fol- 
lows hereinafter when taken in conjunction with the 
drawing figures which accompany it. 

The specific features, aspects, and advantages of 
the present invention will become better understood 
with regard to the following description, appended 
claims, and accompanying drawings where; *o 

FIG 1 is a side view of a chemical mechanical pol- 
•shing (CMP) apparatus typical of the prior art 

FIG 2 is a side view of a chemical mechanical poi- 
<shmg apparatus with endpoint detection cons tr ucted in 
accordance with the present invention 4 s 

FIGS 3A-C are simplified cress-sectional views of 
respective embodiments of the window portion of the 
apparatus of Fig 2 

FIG. 4 is a simplified cross-sectional view of a win- 
dow portion of the apparatus ot Fig. 2, showing compo- so 
nents of a !ase r interferometer generating a laser beam 
and detecting a reflected interference beam. 

FIG 5 is a simplified cross-sectionat view of a blank 
oxide wafer being processed by the apparatus of Fig. 2, 
schematically showing the laser beam impinging on the 55 
wafer and reflection beams forming a resultant interfer- 
ence beam. 

FIG 6 is a simplified top view of the platen of the 



apparatus of Fig 2 showing one possible relative ar- 
rangement between the window and sensor flag, and 
the sensor and laser interferometer 

FIG 7 is a top view of the platen of the apparatus 
of Fig 2, showing a relative arrangement between the 
window and sensor flag, and the sensor and laser 
where the window is in the shape of an arc 

FIG 8 is a flow chart of a method of pieco-wtse data 
acquisition in accordance wrth the present invention 

FIGS. 9A-B are graphs showing the cyclic variation 
m the data signal from the laser interferometer over time 
during the thinning of a blank oxide wafer The graph of 
Fig 9A shows the integrated values of the data signal 
integrated over a desired sample lime, and the graph of 
Fig. 9B shows a filtered version of the integrated values. 

FIG. 1 0A is a block diagram of a backward-looking 
method of determining the endpoint of a CMP process 
to thin the oxide layer of a blank oxide wafer in accord- 
ance with the present invention 

FIG 10B is a block diagram of a forward-looking 
method of determining the endpoint of a CMP process 
to thin the oxide layer of a blank oxide wafei m nuuuid- 
ance with the present invention. 

FIGS. 11 A-c are simplified cross-sectiona! views of 
a patterned wafer with an irregular surface being proc- 
essed by the apparatus of Fig 2 wherein Fig 11 A 
shows the wafer at the beginning of the CMP process. 
Fig. 11 B shows the wafer about midway through the 
process, and Fig 11 C shows the wafer close to the point 
of planarization. 

FIG. 12 is a flow chart diagram of a method of de- 
termining the endpoint of a CMP process to planarize a 
patterned wafer with an irregular surface in accordance 
with the present invention. 

FIG. 1 3 is a graph showing variation in the data sig- 
nal from the laser interferometer over time during the 
planarization of a patterned wafer. 

FIG. 14 is a block diagram of a method of determin- 
ing the endpoint of a CMP process to control the film 
thickness overlying a particularly sized structure, or 
group of similarly sized structures, in accordance with 
:he present invention. 

FIG. 15A is a simplified cross-sectionai view of a 
water w'th a surface imperfection being nominated by 
a narrow-diameter laser beam 

FIG 15B is a simplified cross-sectional view cf a 
wa!e r with a surface imperfection being illuminated by 
a wtde-d<ameler iaser beam 

FIG 16 is a graph shewing 1he cyclic variation in 
the data signal from the laser interferometer over time 
during the thinning cf a blank oxide wafer 3nd including 
the high frequency signal associated with a nonuniform 
wafer surface 

FIG. 1 7 is a schematic representation of a CMP sys- 
tem including an interferometer and a computer pro- 
grammed to analyze and respond to the output signal 
ot interferometer waveform. 

FIG 1 8 is a block diagram showing the functionality 
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that is implemented within the computer to perform m- 
situ monitoring of uniformity. 

FIGS 1 9(a)-(c) show examples of an interferometer 
signal, the interferometer signal after it has been filtered 
by a low frequency bandpass pass filter, and the inter- 
ferometer signal after it has been filtered by a high fre- 
quency bandpass pass filter, respectively 

FIG 20(a) -(b) are flow charts showing the proce- 
dure for generating and then using a signature of a CMP 
system to qualify it for production use. 

F!G 21(a) is simplified crcss-sectiona! view of an 
embodiment of the window portion of the apparatus of 
Fig 2 employing the polishing pad as the window, and 
showing a reflection from the backside of the pad. 

FIG. 21 (b) is a graph showing the cyclical variation 
in the data signal from the laser interferometer over time 
with a large DC component caused by the reflection 
from the backside of the pad of the embodiment of Fig. 
21(a). 

FIG 21(c) is simplified cross-sectional view of an 
embodiment of the window portion of the apparatus of 
Fig 2 employing the polishing pad as the window with 
a diffused backside surface to suppress reflections. 

FIG 21 (d) is a graph showing the cyclical variation 
in the data signal from the laser interferometer over time 
without the large DC component caused by reflection 
from the backside of the pad as a result of the diffuse 
backside surface of the embodiment of Fig. 21(c). 

Fig 2 depicts a portion of a CMP apparatus modi- 
fied in accordance with one embodiment of the present 
invention. A hole 30 is formed in the platen 16 and the 
overlying platen pad 18 This hole 30 is positioned such 
that it has a view of the wafer 14 held by a polishing 
head 1 2 during a portion of the platen's rotation, regard- 
less of the translational position of the head 12. A laser 
interferometer 32 is fixed below the platen 1 6 in a posi- 
tion enabling a laser beam 34 projected by the laser 
interferometer 32 to pass through the hole 30 in the plat- 
en 16 and strike the surface of the overlying wafer 14 
during a time when the hole 30 is adjacent the wafer 1 4. 

A detailed view of the platen hole 30 and wafer 14 
(at a time when it overlies the platen hole 30) are shown 
tn Figs. 3A-C. As can be seen in Ha 3A. the platen hole 
3C has a stepped diameter, thus forming a shoulder 36. 
The shoulder 36 is used to contain and hold a quartz 
insert 36 which functions as a window for the laser beam 

34 The interface between the platen 16 and the insert 

35 is sealed, so that the portion of the chemical s!urry 
40 finding its way between the wafer 14 and insert 38 
cannot leak through to the bottom of the platen 1 6. The 
quartz insert 38 protrudes above the top surface of the 
platen 16 and partially into the platen pad 1 8. This pro- 
trusion of the insert 33 is intended to minimize the gap 
between the top surface of the insert 33 and the surface 
of the wafer 14 By minimizing this gap, the amount of 
slurry 40 trapped in the gap is minimized. This is advan- 
tageous because the slurry 40 tends to scatter light 
traveling through it thus attenuating the laser beam 



omitted from tne laser nterfercmcler 32 The thinner the 
layer of slurry 40 between the insert 36 and the wafer 
14 the less the laser beam 34 and light reflected from 
The wafer, is attenuated, it is believed a gap ct approx- 
s imately 1 mm would result in acceptable attenuation val- 
ues during the CMP process However, it is preferable 
to make this gap even smaller The gap should be made 
as small as possible while still ensuring the insert 38 
does not touch the wafer 14 at any t:me during tne CMP 

to process In a tested embodiment of the present inven- 
tion the gap between the insert 38 and wafer 14 was 
set at 10 mils (250 jam) with satisfactory results 

Fig. 3B shows an alternate embodiment of the plat- 
en 16 and pad 18. In this embodiment, the quartz insert 

*5 has been eliminated and no through-hole exists in the 
pad 18. Instead, the backing layer 20 (if present) of the 
pad 1 8 has been removed in the area overlying the hole 
30 in the platen 16. This leaves only the polyurethane 
covering layer 22 of the pad 18 between the wafer 14 

20 and the bottom of the platen 16 It has been found that 
the polyurethane materia! used in the covering layer 22 
wiil substantially transmit the laser beam 34 from the la- 
ser interferometer 32. Thus, the portion of the covering 
layer 22 which overlies the platen hole 30 functions as 

25 a window for the laser beam 34, This alternate arran ge- 
ment has significant advantages. First, because the pad 
18 itself is used as the window, there is no appreciable 
gap. Therefore, very little of the slurry 40 is present to 
cause the detrimental scattering of the laser beam An- 

30 other advantage of this alternate embodiment ts that pad 
wear becomes irrelevant. In the first-described embod- 
iment of Fig 3A ; the gap between the quartz insert 38 
and the wafer 14 was made as small as possible How- 
ever, as the pad 13 wears, this gap tends to become 

35 even smaller Eventually, the wear could become so 
great that the top surface of the insert 38 would touch 
the wafer 14 and damage it. Since the pad 13 is used 
as the window in the alternate embodiment of Fig 3B, 
and is designed to be in contact with the wafer 1 4, th ere 

40 are no detrimental effects due to the wearing of the pad 
18. It is noted that tests using both the open-cell and 
grooved surface types of pads have shown that the laser 
beam is less attenuated with the grooved surface Dad. 
Accordingly it is preferable that this type of pad be arn- 

45 ployed 

Although the polyurethane material used in the cov- 
ering layer of the pad is substantially transmissive to the 
laser beam, it does contain certain additives which in- 
hibit its transmissiveness. This problem is eliminated in 

50 the embodiment of the invention depicted in Fiq. 3C. In 
this embodiment, the typical pad material in the region 
overlying the platen hole 30 has been replaced with a 
solid polyurethane plug 42 This plug 42, which func- 
tions as the window for the laser beam, is made of a 

55 polyurethane material which lacks the grooving (or 
open-cell structure) of the surrounding pad material, and 
is devoid of the additives which inhibit transmissiveness 
Accordingly the attenuation ol the iaser beam 34 
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through the plug 42 is minimized. Preferably, the plug 
42 is integrally molded into the pad 18 

in operation, a CMP apparatus in accordance with 
the present invention uses the laser beam from the laser 
interferometer to determine the amount of material re- 
moved from the surface of the wafer, or to determine 
when the surface has become planarized The begin- 
ning of this process wi ! be explained in reference to Fig 
4 It is noted that a laser and collimator 44. beam splitter 
46 and detector 48 are depicted as elements of the la- 
ser interferometer 32 This ts done to facilitate the afore- 
mentioned explanation of the operation of the CMP ap- 
paratus In addition, the embodiment of Fig 3A employ- 
ing the quartz insert 38 as a window is shown for con- 
venience. Of course, the depicted configuration is just 
one possible arrangement, others can be employed For 
instance, any of the aforementioned window arrange- 
ments could be employed, and alternate embodiments 
of the laser interferometer 32 are possible One alter- 
nate interferometer arrangement would use a laser to 
produce a beam which is incident on the surface of the 
water at an angle In this embodiment, a detector would 
be positioned at a point where tight reflecting from the 
wafer would impinge upon it. No beam splitter would be 
required in this alternate embodiment. 

As illustrated in Fig 4, the laser and collimator 44 
generate a coilimated laser beam 34 which is incident 
on the lower portion of the beam splitter 46. A portion of 
the beam 34 propagates through the beam splitter 46 
and the quartz insert 38. Once this portion of beam 34 
leaves the upper end of the insert 38, it propagates 
through the slurry 40, and impinges on the surface of 
the wafer 14. The waler 14, as shown in detail in Fig 5 
has a substrate 50 made of silicon and an overlying ox- 
ide layer 52 (i.e. SiO z ). 

The portion of the beam 34 which impinges on the 
water 14 will be partially reflected at the surface of the 
oxide layer 52 to form a first reflected beam 54. Howev- 
er a portion of the light will also be transmitted through 
the oxide layer 52 to form a transmitted beam 56 which 
impinges on the underlying substrate 50 At least some 
of the light from the transmitted beam 56 reaching the 
substrate 50 will be reflected back through the oxide lay- 
er 52 to form a second reflected beam 58. The first and 
second reflected beams 54 58 interfere with each other 
constructivery or destructively depending on their phase 
relationship, to form a resuttant beam 60, where the 
phase relationship is primarily a function of the thickness 
of the oxide layer 52. 

Although, the above-described embodiment em- 
ploys a silicon substrate with d single oxide iayer : ihose 
skilled in the art will recognize the interference process 
would also occur with other substrates and other oxide 
layers The key is that the oxide layer partially reflects 
and partially transmits, and the substrate at least par- 
tially reflects, the impinging beam In addition the inter- 
ference process may also be applicable to wafers with 
multiple layers overiyinq the substrate. Again, if each 



layer is partially reflective and partially transmissive- a 
resultant interference beam will be created although it 
will be a combination of the reflected beams from all the 
layer and the substrate 

5 Referring again to Fig 4, it can be seen the resultant 

beam 60 representing the combination of the first and 
second reflected beams 54 58 (Fig 5) propagates back 
through the slurry 40 and the insert 38, to the upper por- 
tion of the beam splitter 46 The beam splitter 46 diverts 

io a portion of the resultant beam 60 towards the detector 
48. 

The platen 16 will typically be rotating during the 
CMP process Therefore, the platen hole 30 will onfy 
have a view of the wafer 14 during part of its rotation 

*5 Accordingly, the detection signal from the laser interfer- 
ometer 32 should only be sampled when the wafer 14 
is impinged by the laser beam 34. It is important that the 
detection signal not be sampled when the laser beam 
34 is partially transmitted through the hole 30, as when 

20 a portion is blocked by the bottom of the platen 1 6 at the 
hole's edge, because this wi!! cause considerable noise 
m the signal. To prevent this from happening, a position 
sensor apparatus has been incorporated Any well 
known proximity sensor could be used, such as Hall ef- 

2S feet, eddy current, optical interrupter, or acoustic sensor, 
although an optical interrupter type sensor was used in 
the tested embodiments of the invention and will be 
shown in the figures that follow. An apparatus accord- 
ingly to the present invention for synchronizing the laser 

30 interferometer 32 is shown in Fig. 6, with an optical in- 
terrupter type sensor 62 (e.g. LED/photodtode pair) 
mounted on a fixed point on the chassis of the CMP de- 
vice such that it has a view of the peripheral edge of the 
platen 16 This type of sensor 62 is activated when an 

35 optical beam it generates is interrupted A position sen- 
sor flag 64 is attached to the periphery of the platen 1 6. 
The point of attachment and length of the flag 64 is made 
such that it interrupts the sensor's optical signal only 
when the laser beam 34 from the laser interferometer 

40 32 ts completely transmitted through the previously-de- 
scribed window structure 66 P-or example as shown in 
Ftg. 6, the sensor 62 could be mounted diametrically op- 
posite the laser interferometer 32 v. relation to the center 
of tne platen 16. The flag 64 would be attacheo to the 

45 piaten 1 6 in a position diametrically opposite the window 
structure 66 The length of the flag 64 would be approx- 
imately defined by the dotted lines 68„ although, tne cx 
act length of the fiag 64 would be fine tuned to ensure 
the laser beam is completely unblocked by the platen 

50 16 during the entire time the flag 64 is sensed hy thp 
sensor 62. This fine tuning wouia compensate tor any 
position sensor noise or inaccu racy, the responsiveness 
of the laser interferometer 32, etc Once the sensor 62 
has been activated a signal is generated which is used 

55 to determine when the detector signal from the interfer- 
ometer 32 is to be sampled 

Data acquisition systems capable of using the po- 
sition sensor signal to sample the laser interferometer 
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signal during those times when the water is visible to the 
iaser beam, are well known m the art and do net form a 
novel part of the present invention. Accordingly a de- 
tailed description will not be given herein However 
seme considerations should be taken into account in 
choosing an appropriate system. For example, it is pre- 
ferred that the signal from the interferometer be integrat- 
ed over a period of time. This integration improves the 
signal-to-noise ratio by averaging the high frequency 
no se over the integration period This noise has various 
causes, such as vibration from the rotation of the platen 
and wafer and variations in the surface of the wafer due 
to unequal planarization In the apparatus described 
above the diameter of the quartz window, and the speed 
of rotation of the platen, will determine how long a period 
of time is available during any one rotation of the platen 
to integrate the signal However, under some circum- 
stances, this available time may not be adequate. For 
instance an acceptable sianaUto-noise ratio might re- 
quire a ionger integration time, or the interface circuitry 
employed in a chosen data acquisition system may re- 
quire a minimum integration tim.9 which exceeds that 
which is available tn one pass 

One solution to this problem is to extend the platen 
hoie along the direction of rotation of the platen In other 
words, the window structure 66' (i.e. insert pad, or plug) 
would take on the shape of an arc, as shown in Fig 7 
Of course, the flag 64' is expanded to accommodate the 
longer window structure 66' Alternately, the window 
could remain the same but the laser interferometer 
would be mounted to the rotating platen directly below 
the window In this case the CMP apparatus would have 
to be modified to accommodate the interferometer be- 
low the platen, and provisions would have to be made 
to route the detector signal from the interferometer. 
However, the net result of either method would be to 
lengthen the data acquisition time for each revolution of 
the platen. 

Although lengthening the platen hole and window 
is advantageous it does somewhat reduce the surface 
area of the platen pad. Therefore, the rate of planariza- 
tion is decreased in the areas of the disk which overlie 
the window during a portion of the platen's rotation in 
addition, the length ot the platen hole and window must 
ret extend beyond the edges cf the wafer and the data 
sampling must not be done when the window is beyond 
the edge of the wafer, regardless of the wafer's transia- 
ticnal position. Therefore, the length of the expanded 
platen hole and window, or the time which the platen- 
mounted interferometer can be sampled, is limited by 
any translational movement of the polishmq head. 

Accordingly, a more preferred method ot obtaining 
adequate data acquisition integration time is to collect 
the data over more than one revolution of the platen. In 
reference to Fig. 6. during step 102, the laser interfer- 
ometer signal is sampled during the available data ac- 
quisition time in each rotation ot the platen. Next, in 
steps 104 and 106, each sampled signal is integrated 



over the aforementioned data acquisition time, and the 
integrated values are stored Then in steps 1 08 and 
110 a cumulative sample time is computed after each 
complete revolution of the platen and compared to a de- 

$ sired minimum sample time Of course this would con- 
stitute only one sample time if only one sample has been 
taken If the cumulative sample time equals or exceeds 
the desired minimum sample time, then the stored inte- 
grated values are transferred and summed as shown 

•o >n step 112 if not, the process of sampling, integrating, 
storing, computing the cumulative sample time, and 
comparing it to the desired minimum sampie time con- 
tinues In a final step 1 1 4. the summed integrated values 
created each time the stored integrated values arc 

'5 transferred and summed, are output as a data signal. 
The just-described data collection method can be imple- 
mented in a number of well known ways, employing ei- 
ther logic circuits or software algorithms. As these meth- 
ods are well known, any detailed description would be 

20 redundant and so has been omitted it is noted that the 
method of piece-wise data collection provides a solution 
to the problem of meeting a desired minimum sample 
time no matter what the diameter of the window or the 
speed of platen rotation In fact, if the process is tied to 

25 the position sensor apparatus, the platen rotation speed 
could be varied and reliable data would still be obtained 
Only the number of platen revolutions required to obtain 
the necessary data would change. 

The aforementioned first and second reflected 

w beams which formed the resultant beam 60, as shown 
in Figs. 4 and 5, cause interference to be seen at the 
detector 48 If the first and second beams are in phase 
with each other, they cause a maxima on detector 46 
Whereas, if the beams are 130 degrees out of phase, 

35 they cause a minima on the detector 48. Any other 
phase relationship between the reflected beams will re- 
sult in an interference signal between the maxima and 
minima being seen by the detector 48. The result is a 
signal output from the detector 48 that cyclically varies 

40 with the thickness of the oxide layer 52, as it is reduced 
during the CMP process. In fact, it has been observed 
that the signal output from the detector 49 w;l! vary in a 
biftusoidai-iike manner, as shown in the graphs of Figs. 
Q A-B T he graph of Fig. 9A shows the integrated ampii- 

4$ tube of the oetector signal (y-axis) over each sample pe- 
riod versus time (x-axis). This data was obtained by 
monitoring the laser interferometer output of the appa- 
ratus of Fig. 4, while performing the CMP procedure on 
a wafer having a smooth oxide iayer overlying a silicon 

50 substrate (i.e a blank oxide wafer) The graph of Fig. 
9B represents a filtered ve r sion of the data from the 
qrapn of Fig 9A This filtered version shows the cyclical 
variation in the interferometer output signal quite clearly 
It should be noted that the period of the interference sig- 

55 nal is controlled by the rate at which material is removed 
from the oxide layer during the CMP process Thus, fac- 
tors such as the downward force placed on the wafer 
against the platen pad, and the relative velocity between 
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ths platen and the wafer determine the period During 
each period of the output signal plotted m Figs 9A-B. a 
certain th ! ckness of the oxide layer is removed The 
thickness removed is proportional to the wavelength of 
the laser beam and the index of refraction of the oxide 
layer Specifically the amount of Thickness removed per 
period is approximately V2n where k is the freespace 
wavelength of the laser beam and n is the index of re- 
fraction of the oxide layer Thus, it is possible to deter- 
mine how much of the oxide layer is removed in-situ, 
during the CMP process using the method illustrated in 
Fig 10A. First m step 202, the number of cycles exhib- 
ited by the data signal are counted Next, in step 204. 
the thickness of the material removed during one cycle 
of the output signal is computed from the wavelength of 
the laser beam and the index of refraction of the oxide 
layer of the wafer. Then, the desired thickness of mate- 
ria! to be removed from the oxide layer is compared to 
the actual thickness removed, in step 206. The actual 
thickness removed equals the product of the number of 
cycles exhibited by the data signal and the thickness of 
material removed dur ing one cycle. In the final step 208, 
the CMP process is terminated whenever the removed 
thickness equals or exceeds the desired thickness of 
material to be removed. 

Alternately, less than an entire cycle might be used 
to determine the amount of material removed In this 
way any excess material removed over the desired 
amount can be minimized. As shown in the bracketed 
portions of the step 202 in Fig. 10A, the number of oc- 
currences of a prescribed portion of a cycle are counted 
in each iteration. For example, each occurrence of a 
maxima (i e peak) and minima (i e valley), or vice ver- 
sa, would constitute the prescribed portion o1 the cycle. 
This particular portion of the cycle is convenient as maxi- 
ma and minima are readily detectable via well know sig- 
nal processing methods. Next, in step 204, after deter- 
mining how much material is removed during a cycle, 
this thickness is multiplied by the fraction of a cycle that 
the aforementioned prescribed portion represents. For 
example in the case of counting the occurrence of a 
maxima and minima which represents one-half of a cy- 
cle, the computed one-cycle thickness would be multi- 
plied by onc-haff tc obtain the thickness cf the oxide lay 
er removed during tne prescribed oortion of the cycle 
The remaining steps in the method remain unchanged 
The net result of this alternate approach is that the CMP 
process can be terminated after the occurrence of a por- 
tion of the cycie. Accordingly, any excess material re- 
moved will, m most cases, be less than it wouid have 
hppn if p full eye'e whe r 9 used as the basis f o r deter- 
mining the amount ot material removed. 

The just-described methods look back from the end 
ot a cycle, or portion thereof, to determine if the desired 
amount of material has been removed. However, as in- 
ferred above, the amount of material removed might ex- 
ceed the desired amount. In some applications, this ex- 
cess removal of material might be unacceptable In 



these cases, an alternate method can be employed 
which looks forward and anticipates how much material 
will be removed over an upcoming pei.od of time and 
terminates the procedure when the desired thickness is 

s anticipated to have been removed A preferred embod- 
iment of this alternate method is illustrated in Fig 10B 
As can be seen, the first step 302 involves measuring 
the time between the first occurrence of a maxima and 
minima or vice versa in the detector signal (although 

1 0 an entire cycle or any portion thereof could have Deen 
employed). Next in step 304, the amount of material re- 
moved during that portion of the cycie is determined via 
the previously described methods A removal rate is 
then calculated by dividing the amount of material re 

*5 moved by the measured time, as shown in step 306. 
This constitutes the rate at which material was removed 
in the preceding portion ot the cycle. In the next step 
303, the thickness of the material removed as calculated 
■n step 304 is subtracted from the desired thickness to 

20 be removed to determine a remaining removal thick- 
ness Then, in step 310, this remaining removal thick 
nasb is divided by the removal rale lo determine how 
much longer the CMP process is to be continued before 
its termination. 

25 it must be noted, however that the period of the de- 
tector signal, and so the removal rate, will typically vary 
as the CMP process progresses Therefore, the above- 
described method is repeated to compensate for this. In 
other words, once a remaining time has been calculat- 

30 ed, the process is repeated for each occurrence of a 
maxima and minima, or vice versa. Accordingly, the time 
between the next occurring maxima and minima is 
measured, the thickness of material removed during the 
portion of the cycle represented by this occurrence of 

35 the maxima and minima (i.e. one-half) is divided by the 
measured time, and the removal rate is calculated, just 
as in the first iteration of the method. However, in the 
next step 30B, as shown in brackets, the total amount 
of material removed during all the previous iterations is 

40 determined before being subtracted from the desired 
thickness. The rest of the method remains the same in 
that the remaining thickness tc bo romovod is divided 
oy the newly calculated removal rate to determine the 
"omaining CMP process time !n this way the remaining 

45 process time is recalculated atter each occurrence ot 
the prescribed portion of a cycie of the detector signal 
This process continues until the remaining CMP proc 
ess time will expire before the next iteration can begin 
At that point the CMP process is terminated, as seen in 

so step 31 2 Typically the thickness to be removed will not 
be accom n !'shed m the first one-half c w c!e of the detec- 
tor signal and any variation in the removal rate after be- 
ing calculated for the preceding one-half cycle will be 
small. Accordingly, it is believe this forward-looking 

55 method will provide a very accurate way of removing just 
the desired thickness from the wafer 

While the just-described monitoring procedure 
works well for the smooth-surfaced blank oxide wafers 
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being thinned, it has boon found that the procedure can- 
rot be successfully used to planarize most patterned 
waters where the surface topography is highly irregular 
The reason for this is that a typical patterned wafer con- 
ta.ns dies which exhibit a wide variety of drtferently sized 
surface features These differently sized surface fea- 
tu r es tend to polish at different rates For example, a 
smaller surface feature located relatively far from other 
featurestendstobe reduced fasterthan other larger fea- 
tj'es Fig 1 1 A-C exemplify a set ct surface features 72. 
74 75 of the oxide layer 52 associated with underlying 
structures 79, 80 82, that might be found on a typical 
patterned wafer 14, and the changes they undergo dur- 
ing the CMP process Feature 72 is a relatively small 
feature, feature 74 is a medium sized feature, and fea- 
ture 76 is a relatively large feature. Fig 11 A shows the 
features 72, 74, 76 before polishing, Fig 11 B shows the 
features 72, 74, 76 about midway through the polishing 
process and Fig 1 1 C shows the features 72 74 76 to- 
wards the end of the polishing process. In Fig 11 A. the 
smaller feature 72 will be reduced at a faster rate than 
either the medium or large features 74, 76. !n addition, 
the medium feature 74 will be reduced at a faster rate 
than the large feature 76. The rate at which the features 
72, 74, 76 are reduced also decreases as the polishing 
process progresses For example, the smaller feature 
72 will initially have a high rate of reduction, but this rate 
will drop off during the polishing process. Accordingly, 
Fig 11 B shows the height of the features 72, 74. 76 
starting to even out, and Fig. 11 C shows the height of 
the features 72, 74, 76 essentially even. Since the dif- 
ferently sized features are reduced at different rates and 
these rates are changing, the interference signal pro- 
duced from each feature will have a different phase and 
frequency Accordingly, the resultant interference sig- 
nal, which is partially made up of all the individual re- 
flections from each of the features 72, 74, 76, will fluc- 
tuate in a seemingly random fashion, rather than the 
previously described periodic sinusoidal signal 

However, as alluded to above, the polishing rates 
of the features 72, 74, 76 tend to converge closer to the 
point of Dlanarization Therefore the difference in phase 
ar,d frequency between the interference beams pro- 
duced by the features 72 74 76 tend to approach zero. 
This results in the resultant interference signal becom- 
ing recognizable as a periodic sinusoidal wave form. 
Therefore, it is possible to determine when the surface 
of a patterned wafer has become planarized by detect- 
ing when a sinusoidal interference signal begins This 
method is illustrated in Fig 12 First, in step 402, a 
search is made for the aforementioned sinusoidal vari- 
ation in the interferometer signal When the sinusoidal 
variation is discovered, the CMP procedure is terminat- 
ed as shown in step 404 

Fig 13 is a graph plotting the amplitude of the de- 
tector signal over time for a patterned wafer undergoing 
a CMP procedure The sampled data used to construct 
this graph was held at its previous integrated value until 



the next value was reported, thus explaining the 
squared-eff peak values shown A close inspection 
shows that a discernible sinusoidal cycle begins to 
emerge at approximately 250 seconds. This coincides 

5 with the point where the patterned wafer first became 
ptananzed Of course in real-time monitoring of the 
interferometer's output signal it would be impossible to 
know exactly when the cycling begins Rather at least 
some portion of the cycle must have occurred before it 

'0 can oe certain that the cycling has begun Preferably, 
no more than one cycle is allowed to pass before the 
CMP procedure is terminated. A one-cycle limit is a 
practical choice because it provides a high confidence 
that the cycling has actually begun, rather than the sig- 

*5 nal simply representing variations in the noise caused 
by the polishing of the differently sized features on the 
surface of the wafer. In addition, the one-cycle limit en- 
sures only a small amount of material is removed from 
the surface of the wafer after it becomes planarized it 

20 has been found that the degree of plananzation is es- 
sentially the same after two cycles, as it was after one. 
Thus, allowing the CMP procedure to continue would 
only serve to remove more material from the surface of 
the wafer Even though one cycle is preferred in the case 

25 where the CMP process is to be terminated once the 
patterned wafer becomes planarized, it is not intended 
that the present invention be limited to that time frame 
If the signal is particularly strong, it might be possible to 
obtain the same level of confidence after only a portion 

3Q of a cycle. Alternately, if the signal is particularly weak, 
it may take more than one cycle to obtain the necessary 
confidence The choice will depend on the characteris- 
tics of the system used For instance, the size of the gap 
between the quartz window and the surface of the wafer 

35 will have an effect on signal strength and so the deci- 
sion on how many cycles to wait before terminating the 
CMP process. 

The actual determination as to when the output sig- 
nal from the laser interierometer is actually cycling, and 

40 so indicating that the surface of the wafer has been 
planarized can be done in a variety of ways. For exam- 
p!e, the signal could be digitally processed and an =tiyo- 
ritnm employed to make the determination. Such a 
method is disclosed in U.S. patent 5.097,430, where the 

45 siope of the signal is used to make the determination. 
In addition, various well known curve fitting algorithms 
are available These methods would essentially be used 
to compare the interferometer signal to a sinusoidal 
curve. When a match occurs within some predeter- 

50 mined tolerance it is determined that the cycling has 
begun. Some semiconductor applications require tlidt 
the thickness of the material overlying a structure 
formed on a die of a patterned wafer (i e the film thick- 
ness) be at a certain depth, and that this film thickness 

55 be repeatable from die to die, and from wafer to wafer 
The oreviously described methods for piananzmg a typ- 
ical patterned wafer will not necessarily produce this de- 
sired repeatable film thickness The purpose of the 
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planari7ation methods is to create a smooth and flat sur- 
face no: to produce a particular Mm thickness. Accord- 
ingly, if rt is desirable to control the film thickness over 
a specific structure, or group of similarly sized struc- 
tures, an arternate method must be employee This al- 
ternate method is described below 

As alluded to previously ; each differently sized sur- 
face feature resulting from a layer of oxide being formed 
over a patterned structure on a die tends to produce a 
reflected interference signal with a unique frequency 
and phase. It is only close to the point of plananzation 
that the frequency and ohase of each differently sized 
feature converges Prior to this convergence the unique 
frequency and phase of the interference signals caused 
by the various differently sized features combine to pro- 
duce a detector signal that seems to vary randomly. 
However, it is possible to process this signal to eliminate 
the interference signal contributions of all the features 
being polished at different rates, except a particularly 
sized feature, or group of similarly sized features Once 
the interference signal associated with the particularly 
siZ9d feature, or group of features, has been isolated. 
The methods discussed in association with the removal 
of material from a blank oxide disk are employed to re- 
move just the amount of material necessary to obtain 
the desired film thickness 

Of course, the frequency of the interference signal 
component caused by the feature of interest must be 
determined prior to the signal processing. It is believed 
this frequency can be eastly determined by performing 
a CMP process on a test specimen which includes dies 
exclusively patterned with structures corresponding to 
the structure which is to have a particular overlying film 
thickness. The detector signal produced during this 
CMP process is analyzed via well known methods to de- 
termine the unique frequency of the interference signal 
caused by the surface features associated with the 
aforementioned structures. 

The specific steps necessary to perform the above- 
described method of controlling the film thickness over 
a specific structure, or group of similarly sized structures 
on a die, in situ, during the CMP processing of a wafer, 
v^ill new be described in reference to Fig. 14. in step 
502. the detector siqnai is filtered to pass only the com- 
ponent of the signal having the predetermined frequen- 
cy associated with the structure of interest This step is 
accomplished using well known band pass filtonng toch- 
mques Next, in step 504 a measurement is made of the 
time between the first occurrence cf a maxima and mini- 
ma, or vice versa, in the detector signal (although an 
entire cycle or any portion thereof could have been em- 
ployed) The amount cf material removed during that 
portion of the cycle (i e one-half cycle) is determined in 
step 506 via previously described methods Then, a re- 
moval rate is then calculated by dividing the amount of 
material removed by the measured time, as shown in 
step 506 This constitutes the rate at which material was 
removed in the preceding portion of the cycle In the next 



step 510, the thickness of the material removed as cat- 
enated m step 506 is subtracted from the desired thick- 
ness to be removed (i.e. the thicKress which when re- 
moved will result in the desired film thickness overlying 

s tne structure of interest), to determine a remaining re- 
moval thickness Then, this remaining removal thick- 
ness is divided by the aforementioned removal rate to 
determine how much longer the CMP process is to be 
continued before it termination, in step 512. Once a re- 

*o mamingtimehas been calculated the process is repeat- 
ed for each occurrence of a maxima and minima or vice 
versa Accordingly the time between the next occurring 
maxima and minima is measured, the thickness of ma- 
terial removed during the portion of the cycle represent- 

1 $ ed by this occurrence of the maxima and minima (i.e. 
one-half) is divided by the measured time, and the re- 
moval rate is calculated, just as in the first iteration of 
the method. However in the next step 51 0 as shown in 
brackets, the total amount of material removed during 

20 all the previous iterations is determined before being 
subtracted from the desired thickness. The rest of the 
method remains the same in that the remaining thick- 
ness to be removed is divided by the newly calculated 
removal rate to determine the remaining CMP process 

25 time This process is repeated until the remaining time 
expires before the next iteration can begin At that point, 
the CMP process is terminated, as seen in step 514 

it is noted that although the method for controlling 
film thickness described above utilizes the method for 

30 determining the CMP process endpomt illustrated in Fig. 
10B, any of the other endpomt determination methods 
described herein could also be employed, if desired 

It is further noted that the beam diameter (i e spot) 
and wavelength of the laser beam generated by the la- 
ss ser interferometer can be advantageously manipulated 
As shown in Figs. 15A and 1 5B. a narrow beam 84, such 
as one focused to the smallest spot possible for the 
wavelength employed, covers a smaller area of the sur- 
face of the wafer 14 than a wider, less focused beam 

*o 86 This narrow beam 84 is more susceptible to scatter- 
ing (i.e. beam 68) due to surface irregularities 90, than 
the wider beam 86, since the wider beam 86 spreads 
oui over more of the surface area of the wafer 14 and 
encompasses more cf the surface irregularities 90. 

4S Therefore a wider beam 86 would have an intearatmg 
effect and would be less susceptible to extreme varia- 
tions in the reflected interference signal, as it travels 
acoss the surface of the wafer 1 4. Accordingly, a wider 
beam 86 is preferred for this reason The laser beam 

50 width can be widened using well known optical devices 
It must also be pointed out that the wider beam will 
r educe the available data acquisition time per olaten 
revolution since the time in which the beam is complete- 
ly contained within the boundaries of the window is less 

$ 5 than it would be with a narrower beam. However, with 
the previousfy described methods of data acquisition, 
this should not present a significant problem. In addition, 
since the wider beam also spreads the light energy out 
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over a larger area than a narrower beam the intensity 
of the reflections will be lessen somewhat This draw- 
back can be remedied by increasing the power of the 
laser beam from the laser interferometer so that the loss 
in intensity of the reflected beams is not a factor m de- 
tection 

As for the wavelength of the laser beam, it is feasi- 
ble to employ a wavelength anywhere from the far infra- 
feci to ultraviolet However, it is preferred that a beam in 
the red light range be used. The reason for this prefer- 
ence is two-fold. First, shorter wavelengths result in an 
increase in the amount of scattering caused by the 
chemical slurry because this scattering is proportional 
to the 4th power of the frequency of the laser beam. 
Therefore, the longer the wavelength, the less the scat- 
tering. However, longer wavelengths also result in more 
of the oxide layer being removed per period of the inter- 
ference signal, because the amount of materia! re- 
moved per period equals approximately 7J2n. There- 
fore, the shorter the wavelength, the less material re- 
moved in one period, it is desirable to remove as little 
of the material as possible during each period so that 
the possibility of any excess material being removed is 
minimized. For example, in a system employing the pre- 
viously described method by which the number of cy- 
cles, or a portion thereof, are counted to determine the 
thickness of the oxide layer removed, any excess ma- 
terial removed over the desired amount would be mini- 
mized if the amount of material removed during each 
cycle, or portion thereof, is as smalt as possible. 

It is believed these two competing factors in the 
choice of wavelength are optimally balance if a red light 
laser beam is chosen Red light offers an acceptable de- 
gree of scattering while not resulting in an unmanagea- 
ble amount of material being removed per cycle. 

Further Embodiments 

The generated interference waveform provides 
considerable additional information about the polishing 
process This additional information can be used to pro- 
vide an in-situ measurement of tne uniformity of the pol- 
ished layer !t can also be used to detect when the CM^ 
system is not operating wrthtn spec (i.e., noi operating 
as desired). Both of these uses will now be described. 

Uniformity Measurement. 

The polishing and/or piananzation operations which 
are performed on the CMP system are generally re- 
quired to produce a surface iayer that is uniform across 
the surface of the wafer/substrate. In other words, the 
center of the wafer should polish at the same rate as the 
edge of the wafer. Typically, the thickness of the pol- 
ished layer must not vary by more than about 5-10%. If 
that level of uniformity is not achieved, it is likely that the 
wafer will not be usable since the device yields will be 
unacceptably low. In practice, it is often quite difficult to 



achieve a uniform polishing rate across the wafer It typ- 
ically requires optimizing many different variables to 
Keep it performing within the specs The end point de- 
tector described above provides a very useful tool for 
s monitoring the uniformity of the iayer being polished and 
that monitoring can be performed both in-situ data ac- 
quisition and processing 

We have discovered that the interference waveform 
that is produced by the interferometer during polishing 

J0 provides information about the uniformrty of the layer 
that is being polished. As noted above the output of the 
interferometer appear as a sinusoidal signal as the sur- 
face layer (e g oxide layer) is being polished The dis- 
tance between the peaks of that signal indicate how 

1 $ much material has been removed. On top of that sinu- 
soidal signal there will also be another higher frequency 
sinusoidal signal. The amplitude of the higher frequency 
signal indicates by how much the thickness of the pol- 
ished layer varies across the surface of the wafer. 

20 The reason that the high frequency signal appears 
is as follows. As the polishing is being performed, the 
interferometer typically samples (or looks at) different 
locations across the surface of the wafer. This is be- 
cause during polishing., both the platen and the wafer 

2S are rotating and in addition the wafer is also being 
moved axially relative to the platen Thus, during polish- 
ing different areas of the wafer's surface pass over the 
hole in the platen through which the interferometer sees 
the layer that is being polished. If the polished layer is 

30 completely uniform, the resulting interference waveform 
will be unaffected by the sampling of the different loca- 
tions across the wafer's surface. That is, it will have sub- 
stantially the same amplitude. On the other hand, if the 
polished layer is not uniform, the sampling of different 

35 locations introduce, a further variation onto the sinusoi- 
dal base signal. This further variation has a frequency 
that is dependent on the rotation and sweep rates that 
are used and it has an amplitude that is proportional to 
the degree of nonuniformity of the polished layer An ex- 

40 ample of such a waveform is shown in Fig. 16 In this 
particular example the nonuniformity was relatively 
iarge so as to clearly illustrate the high frequency signal. 

A measure o* the un^omnity ! s the ratio o* t h e peak- 
io-peak amplitude A hf of tne high frequency signal to the 

4£ peak-to-peak amplitude A, f of the low frequency signal. 
Tne smaller this ratio, the more uniform the polished lay- 
er wii! be, and conversely, the larger this ratio, the more 
nonuniform it will be. 

A CMP system which produces a measure of uni- 

so formity is shown in Fig. 17. In addition to the comDonents 
shown in the previously described Fig. 2. it aiso includes 
a computer 1 50, which is programmed to control the op- 
eration of the interferometer and to perform the signal 
analysis that is required to produce a measure of uni- 

55 formity from the interference signal, and it includes a dis- 
play unit 160 through which various information and re- 
sults are displayed to an operator Computer 150 can 
be any device which is capable of performing the control 
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ard signal processing functions ncluding, for example, 
a standard PC which is programmed appropriately and 
a dedicated, specially designed digital processing unit. 
Display unit 1 60 can be a video display a printer, or any 
appropriate device or combination of devices for com- 5 
municatmg information to the operator of the CMP sys- 
tem 

To generate a uniformity measure, computer 1 50 is 
programmed to implement and perform the signal 
processing and other functions shown in Fig 18 In that 1 0 
regard, computer 150 implements two programmable 
bandpass filters, namely, a high frequency fitter 1 52 and 
a low frequency filter 154 High frequency filter 152 has 
a passband centered on the frequency of the high fre- 
quency signal containing the uniformity information and 1 $ 
low frequency filter 1 54 has a passband centered on the 
frequency of the lowfrequency signal containing the pol- 
ishing rate information. The width of both of these pass- 
bands is on the order of a few milliherz in the case when 
the period is on the order of tens of seconds. Indeed, 20 
the width of the passband is programmed to vary in pro- 
portion with the center frequency 01 stated differently, 
to vary inversely to the period of the signal being exam- 
ined. That is, if the period of the relevant signal increas- 
es, the bandwidth of the passband filter decreases and 25 
vice versa. 

Fig 19(a) shows an example of an interferometer 
signal obtain from an actual system Note that initially 
the signal indicates that the layer is quite uniform, i e , 
no discernible high Irequency signal is riding on top of 30 
the low frequency signal. After polishing has been per- 
formed for a short period of time, a high frequency signal 
begins to appear, indicating a certain level of nonuni- 
formity. Lowfrequency filter 154 selects the lowfrequen- 
cy component and filters out the other frequencies to 35 
produce an output signal of the 1orm shown in Fig 19 
(b). Similarly high frequency filter 152 selects the high 
frequency component and filters out the other frequen- 
cies to produce an output signal of the form shown in 
Fig 19(c). 40 

Computer 1 50 implements two amplitude measure- 
ment functions 156 and 158 which measure the peak- 
to-peak amplitudes of the output signals of filters 152 
and ">54; 'espective'y. Once the amplitudes cf the two 
filtered signals has been determined, computer 150 4 $ 
computes a ratio cf the p-p amplitude of the high fre- 
quency signal to the p-p amplitude of the low frequency 
signal (i.e.. A hf /A jf ) (see functional block 162) After the 
ratio has been computed, computer 1 50 compares (see 
Diock 166) the computed ratio to a threshold or refer- so 
Anre value 1 64 that was previously stored in local mem- 
ory If the computed ratio exceeds the stored threshold 
value, computer 150 alerts the operator that nonuni- 
formity of the polished layer exceeds an acceptable 
amount. In response the operator can adjust the proc- 55 
ess parameters to bring the process back into spec 

Since the high frequency signal tends to appear on- 
ly alter some polishing has been performed, rt is useful 



to wait before attempting to measure nonuniformity in- 
deed, it may be desirable to automatically compute the 
ratio periodically so as to monitor the uniformity of the 
polished layer throughout the polishing operation In that 
case it may also be desirable for computer 1 50 to output 
the computed ratios throughout the process so that the 
operator can detect changes and/or trends which are 
appearing in the polishing process. This would be par- 
ticularly useful rf the in-situ monitoring whs done during 
on actual production wafers during polishing 

Note that the functions just described can be imple- 
mented through software that is running on the compu- 
ter or they can be implemented through dedicated cir- 
cuits built for this specific purpose 

The bandpass filters can bo implemented using 
techniques which are well known to persons skilled in 
the art In the described embodiment, they are FIR (finite 
impulse response) filters which can be implemented in 
either the frequency or the time domain However to 
perform the filtering in real time as the interferometer 
signal becomes available, the filtering is done in the time 
domain oy convolving the appropriate function with the 
waveform as it is being generated. The appropriate 
function is, of course, simply the time domain represen- 
tation of a bandpass filter having the desired character- 
istics (i.e., center frequency and bandwidth) 

To specify the appropriate filter parameters it is nec- 
essary to know the frequency of the signal that is to be 
selected by the filter. This information can bo obtained 
easily from the interferometer signal waveform(s). For 
example, the center frequency for the low frequency fil- 
ter can be obtained by running a batch (e g, 25) of wa- 
fers (e g blank wafers with only an oxide coating) to ob- 
tain an accurate measure of the polishing rate Alterna- 
tively, the polishing rate can be determined at the start 
of a polishing run by measuring the distance between 
peaks of the low frequency signal. Of course, using this 
alternative approach produces results that are not as ac- 
curate as averaging measurements over a larger 
number of wafers. In any case, the polishing rate deter- 
mines the center frequency of the bandpass filter and 
by knowing the center frequency along with the desired 
bandwidth of the filter one can readi'y determine the pre 
Cise form of the time domain filter function and/or tne 
coefficients of the FIR filter 

The frequency ot the high frequency signal can be 
ooiained in a similar manner i.e.. directiy from the trace 
that is generated by tho interferometer as the CMP sys- 
tem is polishing the wafer In other words, the operator 
simply measures the distance between peaks of the 
high frequency signal. This process can be readily au- 
tomated so that the operator with the aid of a pointing 
device (e g a mouse), can mark two points on the wave- 
form appearing on a video display and the computer can 
bo programmed to automatically compute tho frequency 
and then generate the appropriate filter coefficients The 
filter coefficients and/or time domain representation of 
thy filler functions are then stored m local memcry for 
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use later during the polishing runs to perform the filtering 
operations. 

Process Signature : 

s 

The interferometer waveform also represents a sig- 
nature of (i.e., it characterizes) the system for which it 
was obtained. Because of this, it provides information 
which is useful for qualifying a system for production op- 
eration. If a signature is obtained for a system that is -0 
known to be operating as desired, that signature wave- 
form (or features extracted from the waveform) can be 
used as a reference against which subsequently gener- 
ated signatures can be compared to determine whether 
the system or systems from which signatures were sub- 15 
sequentiy obtained are performing within spec. For ex- 
ample, if the polishing pads are changed or a new batch 
of slurry is used in the CMP system, the operator needs 
to know whether that change has detrimentally affected 
the quality of the polishing which the system performs 20 
We have discovered that a change in performance of 
the CMP system results in a change in the signature. 
That !s. certain features will appear in the waveform that 
were not previously present or previously existing fea- 
tures will change. By detecting those changes, it is pos- 2s 
sible to detect when a system is not performing as de- 
sired 

In the described embodiment, the extracted fea- 
tures from the interferometer waveform are the polishing 
rate and the measure of uniformity. Both of these char- JO 
actenstics are readily obtainable from the interferometer 
wavelorm that is generated during polishing by using the 
methods previously described A properly operating 
system will produce a particular polishing rate and a par- 
ticular measure of uniformity A drift away from these 35 
reference values provides an indication that the system 
is moving away from its desired operating point and 
alerts the operator to the need for corrective action so 
as to avoid destroying product. 

A method which uses a CMP system signature is 40 
illustrated in Fig. 20a and will now be described. Initially, 
an interferometer waveform (s.s a signature) is gener- 
ated for a CMP system which is known to be operating 
optimally (step 250} The decision as to whether the sys- 
tem is operating optimally can be determined empirically 4 & 
by processing a set of test wafers and analyzing the re- 
sults. When the results that arc produced are within 
spec, then the signature can be generated for that con- 
figuration and set of operating conditions. Before cap- 
turing a portion of the interferometer waveform, it is de- 50 
sirable tc polish the wafer between 5C-100% of the way- 
through the oxide so that the waveform is truly a signa- 
ture of the polishing set up 

After the waveform has been obtained, certain rel- 
evant features are then extracted from the generated 55 
waveform (step 252) and stored for later use as a refer- 
ence against which to evaluate that system's perform- 
ance at some later time or times (step 254) Alternative- 



ly, the waveform itself can be stored and used as the 
reference in the described embodiment, the extracted 
features are the polishing rate and the measure of uni- 
formity both of which can be determined from the wave- 
form as described above 

Referring to Fig 20b at some later time the stored 
signature (or extracted features) can be used to qualify 
that system or another system for production use To 
qualify a system for production, a new signature is ob- 
tained for that system (step 253) and the relevant fea- 
tures are extracted from that new signature (step 260) 
The extracted features are then compared to tne stored 
reference set of features (step 264) If the operating 
point, as characterized by the set of extracted features, 
falls within a predetermined region around the reference 
point, as defined by the stored reference set of features, 
then it is concluded that the system is operating properly 
and that it can be Drought online for processing product 
waters (step 266) if this process is automated The com- 
puter may at this point alert the operator that the process 
is within spec. On tho other hand, if the operating point 
falls outside of the predetermined region, that is an in- 
dication that the system is not operating within spec and 
the operator is alerted to this problem so that corrective 
action can be taken (step 268). The corrective action 
might mvorve adjusting some process parameter appro- 
priately to bring the process within spec For example, 
if the polishing rate is excessive or if oxide nonuniformity 
ts larger than permitted then the operator may recog- 
nize that it is appropriate to try a new batch of slurry, or 
to adjust the pressure on the pad, or to even replace the 
pad The particular course of corrective action that is 
chosen will of course depend upon the details of how 
the system has departed from its desired operating 
point the configuration and operating parameters of the 
particular system, and what the operator's experience 
has taught him. 

To provide further useful information to the operator 
the computer also optionally outputs through its display 
device(s) information about the extracted features (step 
262). The displayed information may be presented as 
the exti -luted fydtuf et>, the waveform, how close the var- 
ious extracted features are to the different features of 
the stored reference set, or in wnaiever manner proves 
to be most useful for the operator. 

Of course, the above-described in-situ, real time 
monitoring procedure can be used periodically while 
processing production wafers or whonovor seme proc 
ess parameter is changed in the CMP system (e.g. a 
new polishing pad is used, pad pressure is adjusted, or 
a new batch of slurry is used} and it becomes t )ecebi>ary 
to know that the CMP process is still within spec In ad- 
dition it can be used on blank wafers instead of actual 
product, to qualify the CMP system prior to using it on 
actual product. 

Though we have described a straight forward and 
simple approach to extracting information from the sig- 
nature waveform i.e., by using the polishing rate and 
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the measure of uniformity, the signature or interferom- 
eter waveform can be analyzed by using more sophis- 
ticated techniques (e g pattern or feature recognition or 
other image analysis algorithms, or neural networks, 
just to name a few alternatives) The information which s 
various extracted features convey regarding the opera- 
t.on of the system can be determined through expen- 
erce and the ones which convey the information that is 
perceived to be of most importance to the operator can 
be used 10 

Also, it should be noted that simply displaying the 
interferometer waveform (i.e., the process signature) to 
the operator can be yield valuable feedback on how well 
the system is behaving. Typically, the human eye is ex- 
tremely sensitive in detecting even subtle changes in an 1 $ 
image from what one expects to see. Thus, after gaining 
some expenence, the operator will often be able to de- 
tect changes and imminent problems in the overall CMP 
system performance simply by looking at the waveform. 
Thus, in the described embodiment the computer also 20 
displays the signature waveform to the operator during 
processing so that the operator can also use it to monitor 
equipment performance. 

Using techniques known to persons skilled in the 
art, one can readily develop software algorithms which 25 
automatically recognize or detect the changes for which 
the operator is looking and which tip off the operator to 
uuitain problems. 

A Modification for Obtaining Improved Performance 30 

Another embodiment involves a modification to the 
window in the pad between the interferometer and the 
wafer Although the pad will transmit a substantial por- 
tion of the interferometer laser beam, it has been found 35 
that there is also a significant reflective component from 
the bottom surface of the pad. This situation is illustrated 
in Fig 21 (a) where part of the laser beam 34 emanating 
from the laser interferometer 32 is transmitted through 
Ihe pad 22 to term a transmitted beam 702, and part of 40 
the laser beam 34 is reflected from the backside surface 
7C4 cf tho pad 22 to form a reflected beam 706. The 
reflected beam 706 creates a considerable direct cur- 
rent (DC) shift in the data signal. Fig 21(b) illustrates 
this shift (although exaggerated for purposes of clarity) *s 
inn this example, the DC shift resulting from the reflect- 
ed laser light adds about S O volts to the overall signal 
The DC shift creates problems in analyzing the useful 
portion of the data signal. For example, if the data anal- 
ysis equipment operates in a range of 0-1 0 volts ampli- so 
fication of the DC shifted signal to enhance the portion 
of interest is all but impossible without reducing or elim- 
inating the DC component of the signal If the DC com- 
ponent is not eliminated, the equipment would be satu- 
rated by the amplified signal Reducing or eliminating 55 
the DC component electronically requires added signal 
processing electronics and may result in a degradation 
of the useful portion of the signal Even if the DC shift is 



not as large as described here, some signal processing 
will still likely be required to e'iminate >t Accordingly a 
non-electrcnic method of reducing or eliminating this jn- 
wanted DC component is desirable 

It nas been found that by creating a diffuse surface 
704' on the backside of the pad 22 in the area constitut- 
ing the window, as depicted in Fig 21(c). the reflected 
light from that surface is attenuated Thus, the unwanted 
DC component of the data signal is reduced The diffuse 
surface 704' in effect scatters the non-transmitted laser 
light 703 rather than reflecting most of it back towards 
the interferometer 32 The reflected signal from the wa- 
fer must also pass through the diffuse surface 704' and 
in doing so some of it will also be scattered However, it 
has been found that this does not seriously degrade the 
performance of the interferometer. 

Fig. 21 (d) illustrates the data signal obtained when 
the diffuse surface 704* is employed As can be seen ; 
with the elimination of the DC component, the signal can 
be readily amplified and processed without the need to 
electronically eliminate any DC portion. 

Hcwthe diffuse surface is produced is not of central 
importance It can be produced by sanding the back sur- 
face of the polishing pad in the vicinity of the window or 
by applying a material coating which is diffuse (e.g. 
Scotch tape), r in any other way that produces the de- 
sired results. 

While the invention has been described in detail by 
reference to the preferred embodiments described 
above, it is understood that variations and modifications 
thereof may be made without departing from the true 
spirit and scope of the invention 



Claims 

1. An apparatus for chemical mechanical polishing 
(CMP) of a wafer, comprising: 

(a) a rotatable polishing platen with an overiy- 
pg polishing oad wetted with an abrasive slurry, 
the platen being rotatably mounted tc a chas- 
sis. 

(b) a rotatable polishing nead for holding the 
wafer against the polishing pad, the wafer com- 
prising a semiconductor substrate underlying 
an oxide layer and 

(c) an endpomt detector comprising, 

(cl ) a laser interferometer capable of gen- 
erating a laser beam directed towards the 
water and detecting fight reflected from the 
wafer, and. 

(c2) a window deposed adjacent to a hole 
formed through the platen, the window pro- 
viding a pathway for the laser beam to im- 
pinge on the wafer at least during part of a 
period of time when the wafer overlies the 
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window 

2. The apparatus for claim 1 . wherein the windowcom- 
pnses' 

an insert mounted within the platen hole the 5 
insert being transmissive to the laser beam 

3. The apparatus of claim 2. wherein the window com- 
prises 

a portion of the polishing pad, the portion be- *0 
mg at least partially transmissive to the laser beam 

4. The apparatus of claim 1 , wherein the window com 

prises. 

a plug formed in the pad, the plug being highly 1 $ 
transmissive to tne laser beam. 

5. The apparatus of claim 1 , wherein: 

the hole through the platen, and the window, 
are arc-shaped having a radius with an origin coin- 20 
cident with a centre coincident with a centre of ro 
tation of the platen. 

6. The apparatus of claim 1 , wherein the endpoint de- 
Tecto' further comprises: 2s 

a position sensor for sensing when the win- 
dow is adjacent the wafer such that the laser beam 
generated by the laser interferometer can pass un- 
blocked through the window and impinge on the wa- 
fer. 30 

7. The apparatus of claim 6, wherein: 

(a) the laser interferometer comprises means 

for producing a detection signal whenever light 3S 
reflected from the wafer is detected; 

(b) the position sensor comprises means for 
outputting a sensing signal whenever the win- 
dow is adjacent the wafer such that the laser 
beam generated by the laser interferometer *o 
can pass unblocked through the window and 
impinge on the wafer; and 

(c) the endpoint detector further comprises da- 
ta acquisition means connected to the laser 
interferometer and position sensor for sampling 45 
the detection signal from the laser interferom- 
eter for the duration of the sensing signal from 

the position sensor, the data acquisition means 
comprising a means for outputting a data signal 
representing the sampled detection signal. 5C* 

8. The apparatus of claim 7, wherein the data acqui- 
sition means comprises 

(a) means for integrating the sampled detection 55 
signal from the laser interferometer over a pre- 
determined period of time; and wherein 

(b) the outputting means outputs a data signal 
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representing the ntegrated samples of the de- 
tection signal 

9 The apparatus of claim 7, wherein the data signal 
output by the outputting means is cyclical, and 
wherein the endpoint detector further comprises 

(a) means for counting a number cf cycles ex- 
hibited by the data signal, 

(b) means for computing a thickness of material 
removed during one cycle of the data signal 
from the wavelength of the laser beam and the 
index of refraction cf the oxide layer of the wa- 
fer; 

(c) means for comparing a desired thickness of 
material to be removed from the oxide layer, to 
a removed thickness comprising a product of 
the number of cycles exhibited by the data sig- 
nal from the counting means and the thickness 
oT material removed during one cycle from the 
computer means and. 

(d) means for terminating the CMP whenever 
the removed thickness equals or exceeds the 
des'red thickness of material to be removed. 

10. The apparatus of claim 7, wherein the data signal 
output by the outputting means is cyclical, and 
wherein the endpoint detector further comprises: 

(a) means for measuring the time required by 
trie data signal to complete one of (i) a pre- 
scribed number of cycles, or (n) a prescribed 
portion of one cycle, after each occurrence 
thereof; 

(b) means for computing a thickness of material 
removed during the time measured by the 
measuring means from the wavelength of the 
laser beam and the index of refraction o1 the 
oxide layer of the wafer. 

(c) means for calculating a rate of removal, said 
calculating means dividing the thickness of ma- 
terial removed by the time measured obtained 
from the measuring means. 

(d) means for ascertaining a remaining removal 
thickness, said ascertaining means subtracting 
a cumulative thickness of material removed 
from a desired thickness of material to be re- 
moved from the oxide layer sa'd cumulative 
thickness cf material removed being provided 
by a means for summing the thickness of ma- 
teria! removed commuted Dy the computmq 
means after each occurrence of the one of (i) a 
prescribed number of cycles, or (n) a prescribed 
portion of one cycle. 

(e) means for establishing a remaining CMP 
time said establishing means dividing the re- 
maining removal thickness by the rate of re- 
moval, and 
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(f) moans for terminating the CMP after the ex- 
piration of the remaining CMP time 

1 1 . Tne apparatus of claim 7, wherein the wafer has an 
initially irregular surface topography and is s 
piananzed during the CMP, and the data signal out- 
put by the outputting means is cyclical only after the 
wafer surface is planarized, and wherein the end- 
poirit detector further comprises. 

10 

(a) means for detecting cyclic variation :n the 
data signal; and 

(b) means for terminating the CMP whenever 
the detecting means detects the cyclic variation 

in the data signal. 15 

12. A method for chemical mechanical polishing (CMP) 
of a wafer comprising a semiconductor substrate 
underlying an oxide layer, the method comprising 

the steps of- 20 

(a) holding the wafer in a rotatable polishing 
head against a polishing pad of an underlying 
rotatable polishing platen : the pad being wetted 
with an abrasive slurry; 2S 

(b) determining an endpomt whereat the CMP 
is terminated, the determining step comprising 
the steps of, 

(b1) generating a laster beam directed to- 30 
wards the wafer, the laser beam passing 
through a window disposed adjacent a hole 
formed through the platen, the window pro- 
viding a pathway for the laser beam to im- 
pinge on the wafer at least during part of a 35 
period of time when the wafer overlies the 
window, and, 

(b2) detecting light reflected from the wa- 
fer 

40 

13. The method of claim 12. wherein the determining 
siep further comprises the step of . 

sensing when the window is adjacent the wa- 
fer such that the iaser beam can pass unblocked 
through the window and impinge on the wafer. 

14. The method of claim 13. wherein: 

(a) a detecting step comprises the step of pro- 
ducing a detection signal whenever light re- 50 
fleeted from the wafer is detected. 

(b) the sensing step comprises the step of out- 
putting a sensing signal whenever the window 
is adjacent the wafer such that the laser beam 
generated by the laser interferometer can pass 55 
unblocked through the window and impinge on 

the water; and, 

(c) the determining step further comprises the 



step of data acquisition the data acquisition 
step comprising. 

;c1 ) sampling the detection signal from the 
laser interferometer for the duration of the 
sensing signal, and 

(c2) outputting a data signal representing 
the sampled detection signal 

15. The method of claim 14 wherein the data acquisi- 
tion step further comprises 

(a) integrating the sampled detection signal 
over a predetermined period of time; and 
wherein. 

(b) the outputting step comprises outputting a 
data signal representing the integrated sam- 
ples of the detection signal. 

16= The method of claim 14 wherein the data signal is 
cyclical, and wherein the determining step further 
comprises the steps of. 

(a) counting a number of cycles exhibited by the 
data signal; 

(b) computing a thickness of material removed 
during one cycle of the data signal from the 
wavelength of the laser beam and the index of 
refraction of the oxide layer of the wafer: 

(c) comparing a desired thickness of materials 
to be removed from the oxide layer, to a re- 
moved thickness comprising a product of the 
number of cycles exhibited by the data signal 
and the thickness of material removed during 
one cycle; and, 

(d) terminating the CMP whenever the removed 
thickness equals or exceeds the desired thick- 
ness of material to be removed. 

17. The method of claim 14, wherein the data signal is 
cyclical and wherein the determining step further 
comprises the steps of. 

(a) measuring the time requirea for the data sig- 
nal to complete one of (i) a prescribed number 
of cycles, or (n) a prescribed portio of one cycle 

(b) computing a thickness of malenai removed 
during the time measured from the wave- 
lengths of the iaser beam and the index of re- 
fraction of the oxide layer of the wafer 

(c) calculating a rate of removal cy dividing me 
thickness of material removed by the time 
measured. 

fd) ascertaining a remaining removal thickness 
by subtracting the thickness of mater al re- 
moved from the desired thickness of material 
to be removed from the oxide layer 
ie) establishing a remaining CMP time by divid- 
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ing the remaining removal thickness by the rate 
of removal, ana 

(f) terminating the CMP after the expiration of 
the remaining CMP time 

18 The method of claim 14 wherein the wafer has an 
initially irregular surface topograpny and is 
piananzed during the CMP, and the data signal is 
cyclical only after the wafer surface ss piananzed, 
and wherein the determining step further comprises 
the steps of: 

(a) searching for a cyclic variation in the data 
signal; and 

(b) terminating the CMP whenever the cyclic 
variation in the data signal is discovered 

19. The method of claim 14, wherein the wafer has an 
initially irregular surface topography, and the data 
signal is cyclical only after the wafer surface is 
p!ar,arizcd, and wherein the determining step fur- 
ther comprises the steps of. 

(a) filtering the data signal to pass only a com- 
ponent of the data signal having the predeter- 
mined frequency; 

(b) counting a number of cycles exhibited by the 
filtered data signal; 

(c) computing a thickness of material removed 
during one cycle of the filtered data signal from 
the wavelength of the laser beam and the index 
of refraction of the oxide layer of the water 

(d) comparing a desired thickness of material 
to be removed from the oxide layer, to a re- 
moved thickness comprising a product of the 
number of cycles exhibited by the filtered data 
signal and the thickness of material removed 
during one cycle; and, 

(e) terminating the CMP whenever the removed 
thickness equals or exceeds the desired thick- 
ness of material to be removed 

20. The method of claim 14 wherein the wafer has an 
initially irregular surface topography, ana the data 
signal is cyclical only after the wafer surface Is 
pianarized, and wherein the determining st9D fur- 
ther comprises the steps of. 

(a) filtering the data signal to pass oniy a com- 
ponent of the data signal having the predeter- 
mined frequency; 

(b) counting a number of occurrences of a por- 
tion of a cycle exhibited by the filtered data sig- 
nal, 

(c) computing a thickness of material removed 
during the portion of the cycle of the filtered da- 
ta signal from the wavelength of the laser beam 
and the index of refraction of the oxide layer of 



:ho wa f er; 

(d) comparing a desired thickness of material 
to be removed from the oxide layer to a re- 
moved thickness comprising a product of tne 

5 number of occurrences of the portion of the cy- 

cle exhibited by the filtered data signal and the 
thickness of material removed during the por- 
tion of the cycle, and 

(e) terminating the CMP whenever the removed 
?o thickness equals or exceeds the destred thick- 
ness of material to be removed. 

21 . An in-situ method of measuring uniformity of a layer 
on a substrate during polishing of satd layer said 
1 S method comprising: 

(a) directing a light beam toward said layer dur- 
ing polishing; 

(b) monitoring an interference signal produced 
20 by said light beam reflecting off said substrate; 

and 

(c) computing a measure of uniformity from said 
interference signal. 

25 22. The method of claim 21 wherein said computing 
step comprises: 

extracting feature information from said inter- 
ference signal; and 
30 computing said measure of uniformity from said 

extracted feature information. 

23. The method of claim 22, further comprising compar- 
ing said measure of uniformity to a reference and 

■35 generating an alert when said measure of uniformity 
diverges from the reference by more than a prede- 
termined amount. 

24. The method of claim 22, wherein said interference 
40 signal includes a low frequency component and 

wherein said extracting step comprises 

measuring a first characteristic of said iow fre- 
quency component; and 
45 deriving said extracted information from said 

first characteristic 

25. The method ot claim 24 : wherein said interferom- 
eter signal includes a high frequency component 

t>o and wherein said extracting step further comprises 

measuring a second characteristic of said high 
frequency signal: and 

deriving said extracted information from said 
55 first and second characteristics 

26. The method of claim 25. wherein said first charac- 
teristic is an amplitude of said high freouency signal 
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and said second characteristic is an amplitude of 
said low frequency signal, and wherein said deriv- 
ing step comprises computing a ratio ot the ampli- 
tudes of satd high and low frequency signals. 

5 

27 An in-situ method of characterising a process for 
polishing a substrate having a layer formed thereon, 
said method comprising. 

(a) directing a light beam toward said layer dur- 1 o 
mg polishing; 

(b) monitoring an interference signal produced 
by said light beam reflecting off of said sub- 
strate; 

(c) extracting a signature from said interference is 
signal; 

(d) comparing said extracted signature with 
stored information, wherein said stored infor- 
mation represents a desired operating point for 
the polishing process; and 20 

(e) when said extracted signature diverges 
from said stored information by more than a 
predetermined amount, generating an alert sig- 
nal 

2S 

28. The method of claim 27, further comprising displac- 
ing the interference signal on a visual display device 
for an operator to view. 

29. The method of claim 27, wherein said step ol ex- 30 
tracting comprises: 

determining a polishing rate from said interfer- 
ence signal; and 

determining a measure of uniformity from said 35 
interference signal, 

and wherein said comparing step comprises 
comparing said polishing rate and said meas- 
ure of uniformity to said stored information. 

40 

30. A substrata polishing system comprising: 

fa) a platen which during processing holds a 
polishing pad, 

(b) a polishing head which during processing -#s 
holds the substrate against the polishing pad 

on the platen: 

(c) an interferometer capable of generating a 
collimated light beam which during processing 

is directed at and impinges on a side of said 50 

OuuOuaiU diui |J wUn l\j ^wnOl llu, jdiu I' lit! >t> 

ometer generating an interferometer signal; 
and 

fd) a data processing programmed to compute 

a measure of uniformity from said interference ss 

signal. 

31. The polishing system ot claim 30. wherein said pol- 



ishing head is rotatable so that it rotates during 
processing 

32. The polishing system of claim 31 , wherein said plat- 
en is rotatable so that it rotates during processing 

33. The polishing system of claim 32, wherein said 
interferometer is a iaser interferometer 

34. The polishing system of claim 32. wherein said data 
processor is further programmed to; 

extract feature information from said interfer- 
ence signal; and 

compute said measure of uniformity from said 
extracted feature information. 

35. The polishing system of claim 34. wherein said data 
processing is further programmed to: 

compare said measure of uniformity to a refer- 
ence: and 

generate an alert when said measure ot uni- 
formity diverges from the reference by more 
than a predetermined amount. 

36. The polishing system of claim 35, wherein said in- 
terference signal includes a low frequency compo- 
nent and wherein said data processor is pro- 
grammed to extract feature information by. 

measuring a first characteristic of said iow fre- 
quency component, and 
deriving said extracted information from said 
first characteristic 

37. The polishing system of claim 36, wherein said 
interferometer signal includes a high frequency 
component and wherein said data processor is pro- 
grammed 1o extract feature information by: 

measuring a second characteristic of said high 
frequency signal; and 

deriving said extracted informa lion iiG!m saio 
first and second characteristics. 

38 The polishing system of claim 37. whorcin said first 
characteristic is an amplitude of said high frequency 
signal and said second characteristic is an ampli- 
tude of said 'ow frequency signal, and wherein said 
data processor is programmed to derive said cx 
tracted information by computing a ratio of the am- 
plitudes of said high and low frequency signals 

39. An apparatus for polishing a substrate, said appa- 
ratus comprising: 

(a) a platen which during processing noids a 
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polishing pad said platen including a passage 
therethrough 

(b) a polishing head which during processing 
holds the substrate agamst the polishing pad 

on the platen; and s 

(c) an interferometer capable cf generating a 
ccllimated light beam, said interferometer ar- 
ranged to direct satd light beam at least during 
part of the polishing operation through said 
passage so that it impinges on said substrate 10 

(d) a window aligned wrth said hole and through 
which the light oeam passes on its way to the 
substrate, said window having a diffuse surface 
receiving the arriving light beam from the inter- 
ferometer. 15 

40. The apparatus of claim 39, further comprising the 
polishing pad mounted on said platen and wherein 
said window is formea in said pohshing pad and is 
aligned with said passage. 20 

41. A polishing pad for use in a chemical mechanical 
polishing system, satd polishing pad comprising 

(a) polishing surface; and 2s 

(b) a bottom surface, wherein said bottom sur- 
face and said polishing surface of said polishing 
pad includes a window formed therein, said 
window being transmissive to light from an 
inteiferometer and having a ditluse bottom sur- 30 
face 

42. An apparatus for polishing a substrate, said appa- 
ratus comprising: 

35 

(a) a platen which during processing holds a 
polishing pad r said platen including a window; 

(b) a polishing head which during processing 
holds the substrate against the polishing pad 

on the platen; and <*o 

(c) an interferometer capable of generating a 
ccllimated light beam, said interferometer ar- 
ranged to direct said light beam at least during 
part of the poiishing operation through said win- 
dow so that it impinges on said substrate and 45 
thereby provides for in-situ monitoring of the 
poiishing process 
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